Quality versus quantity of training datasets for artificial intelligence-based whole liver segmentation
This study demonstrates that while highly curated, smaller datasets can achieve equivalent 3D segmentation performance to much larger mixed-curation datasets, the latter offers superior generalizability and local improvements, indicating that the optimal balance between data quality and quantity depends on specific training goals.